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Abstract 

A two-valued fitness landscape is introduced for the classical Eigen’s quasispecies model. This 
fitness landscape can be considered as a direct generalization of the so-called single or sharply 
peaked landscape. A general, non permutation invariant quasispecies model is studied, therefore 
the dimension of the problem is 2^ x 2^, where N is the sequence length. It is shown that if 
the fitness function is equal to w -I- s on a G-orbit A and is equal to w elsewhere, then the mean 
population fitness can be found as the largest root of an algebraic equation of degree at most 
N + 1. Here G is an arbitrary isometry group acting on the metric space of sequences of zeroes 
and ones of the length N with the Hamming distance. An explicit form of this exact algebraic 
equation is given in terms of the spherical growth function of the G-orbit A. Sufficient conditions 
for the so-called error threshold for sequences of orbits are given. Motivated by the analysis of the 
two-valued fitness landscapes an abstract generalization of Eigen’s model is introduced such that 
the sequences are identified with the points of a finite metric space X together with a group of 
isometries acting transitively on X. In particular, a simplicial analogue of the original quasispecies 
model is discussed, which can be considered as a mathematical model of the switching of the 
antigenic variants for some bacteria. 

Keywords: Eigen’s quasispecies model, single peaked landscape, mean population fitness, regular 
polytope, finite metric space, isometry group 

AMS Subject Classification: I5AI8; 92DI5; 92D25 

1 Introduction. Classical quasispecies model 

A great deal of research on the border between mathematics and biology was spurred by Eigen’s 
quasispecies model, formulated in 1971 in m- This model was suggested to describe the replication 
of prebiotic macromolecules in order to study various aspects of the problem of the origin of life. 
Independently, an equivalent model was suggested to study the change of frequencies of different 
genotypes in haploid multi-allele populations under the evolutionary forces of selection and mutation. 
Standard references to review the classical and recent developments are [21 El m na 18]. We also 
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refer to the introductory sections in [5l[l9l|20] for more details on various issues in the quasispecies 
theory. In the present work we are mostly concerned with some specific mathematical developments 
about the model, which can also describe various systems in population biology or chemical kinetics. 

We start with formulating the model. We assume that we deal with a population of sequences 
of the fixed length N. Each sequence is composed of zeroes and ones, hence I := 2^ being the total 
number of different types of sequences. The sequences can reproduce and mutate to each other. We 
also assume that the reproduction events occur at discrete time moments, and sequence k produces Wk 
offspring on average with the probabilities qjk, where qjk is the probability to produce sequence j by 
the parent of type k. Therefore, qkk is the probability of the error-free reproduction, and qjk = 1. 
Let p G R^, = (po) • • • ,Pi-i) be the vector of frequencies of different types of sequences at the 

selection-mutation equilibrium. Then it follows from the basic theory (e.g., [6]) that p can be found 
as the positive normalized eigenvector of the matrix QW corresponding to the dominant eigenvalue 
A, i.e., 

QWp = Xp. (1.1) 

Here W = diag(t(;o,... is the matrix describing the fitness landscape (note that we count the 

indices from 0), and Q = {qjk)ixi is the mutation matrix, which is stochastic by definition. At the 
equilibrium the dominant eigenvalue A is equal to the mean population fitness X = W := WjPj, and 
the vector p was called the quasispecies by Eigen and his co-authors. 

The basic mathematical problem, given W and Q, is to determine w and p. This problem turned 
out to be very nontrivial and required an introduction of intricate methods of statistical physics, careful 
numerical procedures, and non-elementary mathematical analysis to achieve a partial progress (much 
more detail can be found in [5lll9l|20] and references therein). No general analytical solution exists. 
Moreover, even numerically, there are important obstacles to find w and/or p, most serious of which is 
the dimensionality of the problem, recall that the matrices have the dimensions I xl = 2^ x 2^, where 
N is the sequence length. One particular solution to the problem of dimensionality is to consider 
very special fitness landscapes, such that the average number of offspring is determined not by the 
sequence type (which is the ordered list of ones and zeroes) but by the sequence composition (i.e., 
by the numbers of ones and zeroes in the sequence). Such fitness landscapes are sometimes called 
symmetric or permutation invariant and allow to reduce the dimension of the problem from 2^ x 2^ 
to {N -|- 1) X {N + 1). This worked especially well for the so-called single peaked fitness landscape 
defined by 

W = diag(t(; + s,w,..., w), w > 0, s > 0, 

see [nuniEi] for additional details. Moreover, most limiting procedures when the sequence length 
N tends to oo, were applied to the models with the permutation invariant fitness landscapes, e.g., 
[311171 [20]. At the same time it is clear that the assumption that the fitness landscape is permutation 
invariant should be relaxed at least in some specific biological situations. 

Our first goal in this manuscript is to present an efficient method to reduce the dimensionality of 
the mathematical problem from 2^^ to -|- 1 for some specific fitness landscapes that generalize the 
single peaked landscape but are not permutation invariant sensu the definition given above. These 
fitness landscapes still possess a great deal of symmetry but are much more flexible for assigning 
the fitness values compared to the permutation invariant landscapes. Second, by carefully analyzing 
the obtained algebraic equation for w we are able to give a precise mathematical definition of the 
threshold-like behavior, which is observed in some quasispecies models [22|. We present sufficient 
conditions for the model to demonstrate such behavior. The language of the group theory allows us 
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to recast the conditions for the error threshold to occur into the geometric picture of sequences of 
orbits in the underlying metric space under the action of a given group. Third, motivated by these 
considerations, we introduce an abstract generalized Eigen quasispecies problem, give several specific 
examples, and briefly analyze a simplicial analogue of the original quasispecies model. Despite a high 
level of abstraction of the introduced model, even the simplest mathematical construction describes 
biologically realistic systems, in particular, the switching of the antigenic variants for some bacterisfil- 


2 Notation. The reduced problem 

In this section we introduce the required notation and also list several facts necessary for our exposition, 
additional details can be found in m- Recall that N denotes the sequence length and 1 = 2^. 

Let T be a non-empty fixed subset of indices: A C {0,1,..., Z — 1}. For some fixed re > 0, s > 0 
we consider the two-valued fitness landscapes of the form 

f w + s, k £ A , 

“ I w, k ^ A . 

Thus, the diagonal matrix W of fitnesses can be represented as 

W = wl + sEa = wl + s Ea, (2.1) 

a^A 

I being the identity matrix and Ea being the elementary matrix with the only one nontrivial entry 
Saa = 1 on the main diagonal. 

Consider the eigenvalue problem 

QWp = wp, (2.2) 

where p^ = (po; • • • ,Pi-i) £ is the eigenvector of the matrix QW corresponding to the leading 
(dominant) eigenvalue X = w. The vector p is normalized such that 


i-i 

'^Pk = 'i-, Pk>0, 

k=0 


(2.3) 


and hence the equality 

i-i 

'^WkPk = W + s'^Pa = W (2.4) 

k=0 a^A 

holds. 

For the following we make an additional assumption that the mutations at different sites of the 
sequences are independent and the fidelity (i.e., the probability of the error-free reproduction) per site 
per replication is given by the same constant 0 < g < 1 for each site. Then 

Qjk = (1 - j, /c = 0,..., Z - 1 

somewhat modified version of this text appeared in Bull Math Bio, 78(5), 991-1038, 2016, which 
also includes a nontechnical discussion of the main results. The same discussion can be found also at 
https: //a novozhilov.word press.com/2016/03/25/eigen-quasispecies-model-and-isometry-groups/ 
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defines the mutation matrix Q. Here Hjk is the standard Hamming distance between sequences j and 
k (i.e., the number of sites at which sequences j and k are different). Note that now both the leading 
eigenvalue A and the quasispecies p depend on the fitness landscape and, most importantly, on the 
mutation fidelity q, hence we sometimes denote W = w{q) and p = p{q). 

Using the special structure of the mutation matrix Q, it can be shown (see, e.g, [H]) that there 
exists a non-degenerate matrix T such that 

= yT, T-^QT = jTQT =: D, 

with 

D = diag(l,..., (2g - 1)^^ ..., {2q - 

where Hj is the Hamming norm of the sequence j, i.e., the number of ones in this sequence, Hj := H^j] 
we are using the lexicographical ordering of indices, hence, e.g., Hq = 0 and Hi_i = N. Moreover, 
explicitly matrix T is given through the recursive procedure 

T = T]y, Tfc = Ti (g) Tfc_i, k = 2,3,...,N, 


and 


Here ® denotes the Kronecker product (e.g., m)- 

We write down the indices a, b, where 0<a<Z — 1,0<6<Z — 1, in the binary representation: 


a — ao + «i2 + • • • + aAr_i2'^ ^ — [oq, ai, ... , ckat-i] , G {0,1} , 


b — l3o + /3i2 /?Ar_i2'^ ^ — [/3o, /?i, ... , Pn-i] , h £ {0,1} . 

One additional property of T that we will require in the following is given by the following lemma. 


Lemma 2.1. Let T = {tab)ixi be the transition matrix defined above, Pa{z) 
ating polynomial of the a-th column ofT. Then 


i-i 

tkaZ^ be the gener- 

k=0 


Moreover, 


l-l N-l 

Pa{z) = '^tkaZ^ = n . 

k=0 i=0 

N-l 

tab = tba = (-1)^“’^^ , (a, b) := ^ aufik mod 2. 

k=0 


(2.5) 


( 2 . 6 ) 


Proof. We prove the formulas (12.5p and (j2.6p by the induction on N. Indeed, according to the definition 
of T we have the Kronecker product 


T = T fq = Ti® Tat-i 


1 

1 



® Tn-1 


Tn-1 

Tn-1 

Tn-1 

1 

1 
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Let us represent a = = otv-i + aN-i2^~^. The block form of T = implies the equality 

Paiz) = Paff_i{z){l + (—and (I2.5P by induction. 

Consider again the representations a = ajq = OAr_i + Q;Ar_i2^“^, b = bj^ = + fijq. It 

follows from the above form of the matrix T that 


tab = {TN)ab = (2^Af-l)ajv-i6iv-l(“l)°'^ ^ 


The induction on N completes the proof of (12.6p . 


Let us now return to the problem (12.21) . We have 

T-^QTT-^WTT-^p = wT-^p , 


or, in view of m, 

D ( u;I + s ^ T-^EaT j p-^p = wP-^p , 
\ a£A J 

which yields, after some rearrangement, 

{wl - wD)P~^p = jY^ DPEaP . 

aeA 

Let X := P~^p, p = Px. Then (j2.8p implies 

® = y — wD)~^DPEaP , 

aGA 


(2.7) 


( 2 . 8 ) 


(2.9) 


or, in coordinates, 

^ S (2g - l)^HkaPa 

^ ^ ^ ~ '^( 2 ? “ 1 )^'' ' 

Since p = Px, then we get 




i-i 

Pb 'y ^ 

k=0 


S (2g — l)^'‘tbktak 

1 ^ ^ w — w{2q — l)^fc^“ 
aeA k=0 ' 


( 2 . 10 ) 


( 2 . 11 ) 


Note that only the components pa, where a ^ A, are involved in the right-hand side of p2.1ip . We 
can omit the components pb for 6 ^ A and obtain the “reduced” column-vector = (p^), a G A. 

Considering only a G A we can rewrite (12.lip as 


Pa = Mpa , 

where M = {mab)rxr is the square matrix of the order r = |^| with the entries 

_ _ s ^ {2q- l)^Haktbk _ s ^ (2g - l)'^fc(-l)(‘^'fe)+(bfc) 

I — w{2q — ^ w — w{2q — 


( 2 . 12 ) 


(2.13) 


in view of p2.1ip and Lemma 12.11 The equality p2.12p means that the reduced vector p^ is an 
eigenvector of M corresponding to the eigenvalue A = 1. 
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We consider w in ()2.13p as a parameter. It follows from (12.41) that W depends only on pa, a £ A, 
that is, on the reduced vector p^. The original eigenvector p can be reconstructed from with the 
help of (12.111) if p^ is known. Therefore, for the introduced special two-valued fitness landscapes, 
instead of the original problem (j2.2p . we can consider the problem to find the reduced eigenvector 
p^ satisfying (|2.12p and corresponding to the eigenvalue A = 1 of the matrix M defined in (I2.13p . 
Since (j2.12l) defines pa, a £ A in terms of w, then, finally, formula (12.4p can be used to determine w 
implicitly in terms of the system parameters tc, s and q. 

To conclude, we remark that the eigenvalue w can be also found from the equation 

det(M-I) = 0, (2.14) 

but in general it is not easier than to solve the original problem ()2.2I) . For the single peaked landscapes 
(i.e., when A consists of a single element) the corresponding equation was obtained and investigated 
in [19]. In the next section we propose a different approach that can be further elaborated on for some 
special cases. 


3 Equation for the leading eigenvalue w 


In this section we show that, using the preliminary analysis from the previous section, it is possible 
to find an algebraic equation for the eigenvalue w under some additional symmetry requirements on 
the set A, and this equation is of degree at most iV -|- 1. 

First we transform (j2.13p in the following way: 


_ s {2q — l)^'^taktbk _ s {2q — l)^'“tafc^fefc 
7^tc-rc(2g- l)^fe /u; ^ 1 - |(2g - l)^fe 


l-i 


OO 1 — 1 

S /W\^ ' 


(3.1) 


i E E (i) - F-'-- ^ i E (i)T(^<i - . 


fc=0 c=0 

Lemma 3.1. We have the following factorization: 


c=0 


A:=0 


1-1 

E 

k=0 


Z^'^taktbk = (1 - 2:)^“'’(1 + Z) 


N-H,b 


(3.2) 


l-l l-l 

Proof. It is straightforward to see that z^'^takhk = 17 takZ^’^tkb is the entry Zab of the matrix 

k=0 k=0 

Z :=T diag(l,..., ..., z^)T = 2^T diag(l,..., ..., z^)T-^. 

It follows from the properties of T that 

Zab = 2^(1 - q)^o.bq^-Hab = (1 - z)^“<>(l -I- where q = ■ 

Thus, the lemma is proved. ■ 
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Applying Lemma 13.11 to (|3.1I) we get 

OO 

mab = -^Yl (=)' (1 - (2? - 1)'^+')^“" (1 + {2q - 1)'=+!)'^-^“^. (3.3) 

We have the following consequence of (I2.12p : 

rribaPa = '^Pa'^ ^ba- (3.4) 

6Gj4 ciG.A d^A b^A 

Now we introduce a A;ey assumption that will allow us to simplify the analysis. 

We assume that the sum ^ niah does not depend on a E A. Then it follows from (13.4p that 

feeA 

Yh '^ha = 1 for each a E A. In view of p3.3p it implies that 

bGA 


Iw 

s 



j2{i-{2q-ir+Y‘'>> 

beA 


{1 + {2q - 


(3.5) 


if the inner sum does not depend on a E A. 

The main question is when our key assumption holds. We present some sufficient conditions for 
this, and hence for the equation (13.5p . 

We refer to the well known geometric interpretation of the metric space V = Vv = {0,1}'^ with 
the Hamming distance. Consider 1-skeleton of the Wdimensional cube [0,1]'^ with the set of vertices 
V. The vertices a and b are connected by the (unique) edge Cab if Hab = 1- The Hamming distance 
between vertices u and v is the length of a shortest path connecting these vertices, that is, the number 
of edges in this path. The set V, due to the binary representation 

a = ao + «i2 + • • • + aAr_i2'^ ^ = [oq, ai, ... , (Uat-i] , ak E {0,1} , 

can be identified with the set of indices X = = {0,1,..., 2-^ — 1} with the Hamming distance. In 

what follows we will usually make no difference between metric spaces V and X. 

We note that the group Iso(AAr) of all isometries of Ajv, acting on the set Xjy, is also known as 
the Weyl group Wn of order 2-^ A! of the root system of type B]\f (or Cat, see, e.g., m- 

Proposition 3.2. Let G be a group that acts on the metric space X by isometries {i.e., G ^ Iso(A)) 
and let A be a G-orhit. Then the equality dHS]) holds. 

Proof. Since G acts transitively on A and preserves the Hamming distance Hab, the inner sum in (13.5|) 
does not depend on o E A. ■ 


Now we can state the following basic result. 

Corollary 3.3. Under the eonditions of Proposition\3fM the eigenvalue w = w{q) of (j2.2p is a root of 
an algebraie equation {with the coefficients depending on q) of degree at most A + 1. 

Proof. Consider the polynomial (see (13.2p and Lemma l3.II) 

PAz) ■■= w ■ (3.6) 

b£A 
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We have 


(3.7) 


Fa{z) >0, -l<z<l, Fa(0) = ^ , Fa{1) = 1. 


Since 0 < < N, we can rewrite 


as 


1 ^ 


N-d 


d=0 


where fd = #{& E A \ Hab = d}. Applying the binomial expansion to (j3.8[) we get 


N 


FA{z) = Y,hdz\ hd = ^Y.^-iyfi 


2N 

d=0 j=0 

With the introduced notation equation (I3.5p reads 


d\ fN - d 


jj \ d-j 


w 

s 


E(S)A.((2,-1) 


c=0 


c+l'l 


or 


w 

s 


N 


N 


EM2,-i)^E(i) (22-ir' = E 

d=0 c=0 d=0 

Finally, the last equation can be transformed into 


hd{2q - 1)<^W 
w — w{2q — ' 


N 

E 

d=0 


d 


1 


hdi2q - 1 ) 

w — w{2q — 1 )'^ s 


with the rational coefficients hd defined in (|3.9p . 


(3.8) 


(3.9) 


(3.10) 


(3.11) 


We have the following corollary that allows us to reduce the number of computations in some 
special cases. 

Corollary 3.4. Let V = Fw = Iso(Xjv) be the group of all isometries {of order 2 '^A^!) acting on the 
metric space X = Xj^ = {0,1,..., 2'^ — 1} with the Hamming distance and let 7 A be the image of A 
under the (left) action 0/7 G F. Then the equations (13.5D . (13.101) . (13.IIP , and (I3.12P are the same for 
A and 7 A. 

Proof. The equations (13.5P . (j3.10P . (13.IIP , and (I3.12p were obtained only on the ground of the metric 
properties of A. Note that A is a G-orbit if and only if 7 A is a yGy^^-orbit. Thus, the (left-)acting 
group G should be substituted by the conjugate 7 ^ 7 “^. ■ 


The case w = 0 corresponds to the lethal mutations. In particular, we have 

Corollary 3.5. If w = 0 then we get the following polynomial expression for the leading eigenvalue, 
where a ^ A may be chosen arbitrarily in the G-orbit A: 

N 

W = s ^(1 - q)^<^bqN-H,, ^ sFA{2q - 1) = s ^ hd{2q - . 

bGA d=0 
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4 Examples and applications 


In this section we consider several simple examples of the two-valued fitness landscapes and apply the 
obtained equation for the leading eigenvalue. The examples we consider are mostly based on various 
subgroups of the symmetric group Sn- 

The symmetric group G = S]\f acts on the metric space X = Xj^ = {0,1,..., 2^ — 1} with the 
Hamming distance by isometries. To wit, let u G S'w. Then 

rr(n) (j[(ro, rri,..., rrw—i] (o) i ? • • •; (A^—i)] ? ^ {d, 1} . 

Note that G = Sn is a proper subgroup of T = Iso(XAr). The latter is of the order 2^Nl and contains 
also the elements that correspond to reflections of the iV-dimensional cube [0,1]'^, see, e.g.. Example 
14.71 below. 

The Sw-orbits are the subsets of 


Ap = {a £ X\ Ha = p} , p = 0,l,...,N. 

Example 4.1 (General permutation invariant fitness landscapes). Recall that we defined the per¬ 
mutation invariant fitness landscape to be a diagonal matrix W such that the elements on the main 
diagonal are Wj = wHji he., the fitness of the sequence j depends only on the total number of ones in 
this sequence. To satisfy this definition the orbit for the two-valued fitness landscape must coincide 
with one of Ap defined above. 

We can consider only the case 2p < N. Indeed, let 7 (a) = a* = I — 1 — ahe the index conjugate 
to a. The conjugation 7 is an involution in T. The binary representation of o* differs at each position 
from that of a. Then Ha* = N — Ha and a G Ap 4^ a* £ A* = An-p- In other words, according to 
Corollary 13.41 the equations ()3.5p . (jd.lOp . (j.S.llj) . and (j3.12p for Ap and = An-p are the same. 

To obtain an equation for w we will need an auxiliary 

Lemma 4.2. For a,b £ A = Ap the distance Hat is even. Moreover, for each A: = 0,1,... ,p 

#{6 G Hp I Hab = 2k} = Q ( ^ • 

Proof. If Ha and Hb have the same parity, in particular, coincide then Hab is even, hence Hab = 2k. 
The binary representations 


a = [ao, oil, ... , aN-i ], b = [/3o, fdi, ... , Pn-i] , aj,Pj £ { 0 , 1 } 


differ at exactly 2k positions. Thus, in order to obtain the binary representation of b from that 
of a we need to substitute exactly k ones in [ao, ai, ... , aw-i] by zeroes and exactly k zeroes in 
[ao, ai, ... , aw-i] by ones since the total number of ones in both representations of a, b is equal to 
p, Ha = Hb = p. There are (^) variants of such substitutions. ■ 

Lemma 14.21 applied to ()3.5p yields 


w 

s 



c =0 k=0 





(1 - {2q - (1 + {2q - iy+^)^ 


(4.1) 
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We note that in (14.111 we can disregard the restriction 2p < N. The polynomial 

=5!^ E Q (1 - =E ('‘■2) 

fc=o ^ ^ ^ ^ d=0 

of degree N satisfies the conditions (j3.7p . Moreover, = h^-d- 

Therefore, we conclude that for the permutation invariant fitness landscapes we obtained the 
explicit equation (13.111) for the leading eigenvalue w with hd defined in (14.2p . While it is a common 
wisdom that the dimensionality of the quasispecies problem for the permutation invariant fitness 
landscapes can be reduced to A^ +1 from 2^, the explicit equation to determine the leading eigenvalue 
w is, to the best of our knowledge, new. 

Remark 4.3. For the permutation invariant fitness landscapes arguably the most transparent and 
efficient way to analyze the problem is to invoke the so-called maximum principle laiiTiEs], therefore, 
first several examples in this section should be mostly considered as an illustration of the suggested 
technique. Nevertheless, the results we present are exact, contrary to the approximate nature of the 
maximum principle, for which also some technical conditions on the fitness landscape must be imposed 
(without these conditions the maximum principle can lead to incorrect conclusions, e.g., [20 ( 123] 1. Our 
equations work for any fitness landscape and therefore are of interest on their own. 

In what follows we consider several special cases of the previous example. 

Example 4.4 (Single peaked landscape). Let p = 0 in the previous example. Then we deal with the 
classical single peaked fitness landscape. The equation for w was studied in great details in [T^. We 
would like to mention that in view of Corollary 13.41 since the group of isometries T acts transitively 
on the set X of indices then each equation (13.111) for the single peaked landscape A = {a} is the same. 
Consequently, for the leading eigenvalue we can consider the basic case ^0 = {0}, that is, the single 
peak at rco = rc -|- s. 

We can also consider the trivial group G = {1} acting on X in order to treat the same case. 

The polynomial (13.8p becomes 

d=0 ^ ^ 

Hence, the equation (|3.11l) reads 


^ ^ fN\ {2q-lY ^ 1 
2^ ^ \ d J w — w{2q — l)*^ s 


(4.3) 


A very similar expression for a slightly different model was obtained originally in |12j . 


Example 4.5. In the previous notation let A 


Hab = 


Ai = {1, 2,4,8 ,... , 2-^ ^}. Now, for a,h ^ A 

0 , a = b , 

2 , a ^ b . 
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The calculation of the polynomial (13.81) yields 

^Aiiz) = ^ ((1 + + {N - 1)(1 - zf{l + z)^~‘^) = ^ ^-iV ^ 

d=0 ^ ^ 

Hence p.lll) transforms into 


^ A {N - 2df /'N\ {2q - l)'^ _ 1 

^ ^ \d) w — w{2q — 1)^ s' 


(4.4) 


Example 4.6. Let iV = 2n be an even number and let A = An- Applying Lemma [4.2l to (13.5|) we find 


w 

s 


wt{=yt("S (^-(2,- irA" (1+ (M - ir‘) 

c=0 k=0 ^ ^ 


2n—2k 


(4.5) 


Example 4.7 (Antipodal landscape). Consider the set A = {a, a*}, where, as before, a* is the 
conjugate index, a* = I — 1 — a. Let G = { 1 ,( 7 } be the group of order 2 whose nontrivial element 
(involution) g maps each a G A to the conjugate a*. Thus, the set A = {a, a*} is a G-orbit. In view 
of Proposition 13.21 since Haa = 0 and Haa* = A then the equation (13.5p reads 


7 = E (i)“ ((1 - - ir r+(1+(2, - ir r). 

c=0 

In this case the polynomial (13.8p takes the form ([-J stands for the integer part): 

FA{z) = ^{{l + .f + {!-.)”) = ^ E L)^- 

d=0 ^ ' 

Hence the equation (|3.1ip is of degree YN/2\ + 1: 


1 

2 ^ 



{2q - 

w — w{2q — 1)2'^ 


1 

s 


(4.6) 


(4.7) 


Example 4.8 (Quaternion landscape). According to a well known theorem of Cayley each (finite) 
group G is a permutation group (which acts on itself by, for instance, left shifts). It follows that each 
finite group G can be embedded into symmetric group n = |G|. Since Sn acts on the set of indices 
X = Xn then we can find many G-orbits restricting on G the canonical action of Sn on Xn (see the 
beginning of this section). Moreover, since there are standard embeddings Sn -A- Sn+i -A Sn +2 -A ... 
there is no problem to construct the action of any finite group G on the set Xjsi for N > n. This gives 
us a virtually unlimited list of the two-valued fitness landscapes, which are not permutation invariant. 

For instance, let 


G = Qs = {±1, ±i, ±j, ±k\i‘^ = f = = -1, ij = k, jk = i,ki= j} 


(—1 commutes with each element of Qg) be the classical quaternion group of order 8. The embedding 
Qs -A Sg is chosen so that i —>■ (0213) (4657), j —>■ (0415) (2736). 
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Consider a G-orbit, say, 


A = {7,11,13,14,112,176, 208, 224} C Xjv , N> 8 . 


Direct calculations yield the polynomial (|3.8p 


Fa,n{z) — ^((1 + z)^ + 3(1 — z)‘^{l + z)^ ^ + 4(1 - z)^{l + z)^ . 

For N = 8 we have 

Fa,?,{z) = ^((1 + zf" + 3(1 — z)‘^{l + z)^ + 4(1 — z)^{l + = 

= ^(2 +z + Uz^ + 15^^ + 15/ + 14/ + z'^ + 2z^). 

64 

Finally, we obtain the following form of (|3.1in 

A Rd{2q - If _ 64 
^^w — w{2q — iy s ’ 


(4.8) 


(4.9) 


(4.10) 


where Rq = Rg = 2, Ri = R^ = 1, R 2 = Rq = 14, Rg = R^ = 15, R 4 = 0. 

Examples of calculating w are given in Fig. 14.11 where the case N = 8 was also checked numerically 
using the full matrix QW. 




Figure 4.1: The leading eigenvalue W depending on the fidelity q for the two-valued fitness landscape 
with w = 2, s = 5 and the set A as in Example 14.81 (a) N = 8 (this case was also checked numerically, 
using the full matrix QW), (6) N = 50. 


Example 4.9 (Lethal mutations). If tc = 0 the calculations can be significantly simplified (see (I3.12h ). 
Moreover, we can find not only the polynomial expression (|3.12p for the leading eigenvalue provided 
^4 is a G-orbit, G ^ Iso(X), but we can find the eigenvector p (the quasispecies distribution) as well. 
On substituting re = 0 into (|2.ip we obtain 

W = sEA = sY,Ea. (4.11) 

aeA 
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Here Ea is the diagonal matrix corresponding to the projection on the orbit A, E'a = Ea- The 
problem (12.211 can be transformed now as follows: 


sQEaP = w{EaP + (p-Eap)) {=wp). 


(4.12) 


Multiplying (14.1211 from the left by Ea and taking into account that Ea is a projection matrix we 

find _ 

w 

EaQEaP = — Eap- (4.13) 

s 

Note that if we omit zeroes in EaP, we obtain the reduced vector Pa introduced in Section [2j 
Direct calculations and Lemma O show that if we take 

£;aP = 0(0,...,0,1,0,...,0,1,0,...), 0>O, (4.14) 

where the ones stand only for the indices a & A, we get 

EaQEaP = FA{2q-l)EAp. (4.15) 

Let us compare (I4.13h and (I4.15p . In view of (I3.12h we conclude that the vector p satisfying (I4.14p is 
a solution of the problem ()2.2I1 provided W = sFA{2q — 1) (possibly not unique). 

The equality ()4.12ll implies, regardless of 0, that 

P = =QEaP = , ^ ^ QEaP, (4.16) 

w FA{2q - 1) 

where EaP has the form (14.1411 . The normalizing factor 0 should be chosen in such a way that (|2.3I) 
holds. Thus, in coordinates we have 

1 

Pk = 


\A\-FA{2q-l) 


^(1 _ 


fc = 0,...,2^-l, 


b^A 


or 


Pk = 


^ (1 _ q^HkbqN-Hkb 

1 b&A 


, A: = 0, ...,2^-1, a^A 


(4.17) 


|H| X: (1 - q)^<^»q^-^'^>’ 

beA 

The expressions (|4.17l) imply that the distribution p is constant for any fixed q on the G-orbits in the 
set of indices A. 

Using the discussed approach, for Example 14. II and w = 0 we obtain from ()3.12p that 

p 


w = s 


E 

k=0 


p\ (N — p\ ^\2k^N-2k 


(1 - qYW 


In Example 14.61 {N = 2n) we find 

. 2 

w = sY, 

k=0 

In Example 14.71 we have 


(1 - qy^q 


lN/2\ 


2k Jin—2k 


W = 


2N-1 


E 

d=0 

Other examples can be treated similarly. 


N 

2d 


(2g-l)2'^ = s(g" + (l-g)^^). 


- ( 2^7 - 1)2 


N 


when n ^ 1. 


\N\ 


(4.18) 


(4.19) 


(4.20) 
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5 The infinite sequence limit N 


oo 


In Corollary 13.31 we obtained the algebraic equation (|3.1ip of degree at most + 1 for the leading 
eigenvalue w = w{q). The advantage of having a polynomial equation of degree N+1 notwithstanding, 
solving (13.111) becomes complicated as N ^ oo. Moreover, it is well known that at least for some 
fitness landscapes (including the classical single peaked fitness landscape) the phenomenon of the error 
threshold is observed: there exists a critical mutation rate q, after which the quasispecies distribution 
p becomes uniform. This phenomenon is usually identified with a non-analytical behavior of the 
limiting eigenvalue TU when N —>■ oo, a general idea can be grasped from Fig. 14.1b . where it is seen 
that there exists a corner point on the graph of the function w. 

In this section we propose several steps to rigorously define and analyze this kind of behavior in 
terms of sequences of orbits An that determine our two-valued fitness landscapes. First, we find some 
bounds for the function w provided 0.5 < q < 1. Next, we restrict our attention at the special class of 
sequences {An)^=no^ which we call admissible and of the moderate growth (here no is a sufficiently large 
natural number). Finally, among all those admissible sequences of the moderate growth we identify 
the ones that demonstrate some kind of non-uniform convergence for the corresponding sequence of 
eigenvalues 

5.1 Lower and upper bounds on w{q) 

First we note that for our purposes it is easier to deal with the series ()3.10p rather than (13.lip . We 
also make the following substitutions 


w = us , w = us. (5-1) 

Then (|3.10l) turns into 

OO 

“ = E(=)°^-‘>«2«-i)“‘"‘)- P-2) 

c=0 ^ 

where the polynomial Fa{z), defined in (13.61) . can be represented in the form (13.9p . 

From Example 14.91 we have that sFA{2q — 1) = w{q) is the leading eigenvalue if re = 0. It was 
proved in jT9] that w{q) increases on the segment 0.5 < q < 1. Therefore, on this segment we have 
the non-increasing sequence (for any hxed q) 


FA{2q - 1) > FA{{2q - 1)^) > • • • > FA{{2q - If) > FA{{2q - 1)'=+^) > • • • > 0, 


(5.3) 


since FA{{2q — 1)'^) > 0 according to (13.7p . Hence, 

OO C 

-!)«)< Fx(2, - 1) 


u = 


c=0 


c=0 


u\c _ FA{{2q - l))u 
u/ u — u 


(5.4) 


It follows that u < u + FA{2q — 1), or 


w{q) <w + sFA{2q - 1) =: Wup,i{q)- 
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A second upper bound can be obtained as follows: 


u = 


< 


E (=) - 1)"‘) = - 1) + E (=) - !)'+■) 

C=0 C=1 

FA{2q - 1) + FA{{2q - if) £ = ^a(2 (? - 1) + 


C=1 


u — u 


Solving the quadratic inequality we get 

u + FAi2q - 1) + + FA{2q - 1))2 - 4u(FA(2g - 1) - FA{{2q - l)^)) 


u < 


or, 


Wup,i{q) + Jwlp fq) - Aw{sFA{2q - 1) - sFA{{2q - l)^)) 

w{q) < -r-=: Wup, 2 {q)- (5.5) 


Remark 5.1. In view of ()5.3p Wup, 2 {q) < Wupa{q)- 

To obtain a lower bound on w{q) we use the approach applied in [19]. Since w{q) increases on the 
segment 0.5 < g < 1 therefore 

s|A| 


w{q) > r(;(0.5) = w + 


2N 


(5.6) 


By the definition of (137 


F^((2g-l)‘=+i) = ^ 

beA 


l-{2q- 1)'"+^ \ /1 + {2q - 1)'=+^ \ ^ 


> 


l + {2q- 1)^+1 ^ fl + {2q-l)\ 


= qic+l)N 


since a € A, Haa = 0 and the function f{t) = is convex (downward) on the segment [0,1]. 
Now from (|5.2p 


c=0 


°° c — N 

u > —— , or u > (u + l)q^ , or u; > (tc + s)q^. 

^ V u / u — uq^^ 


Combining (15.611 and p5.7p yields 


s|A| 


w{q) > max ( w + {w + s) ) =: wiow{q) ■ 


(5.7) 


(5.8) 


Thus we have proved 

Proposition 5.2. For the leading eigenvalue w{q) of (12.2p in the case of the two-valued fitness land¬ 
scape we have 

wiowiq) < w{q) < Wup, 2 {q), 0.5 <q< 1, 

where wiow{q) is given by (|5.8p . and Wup, 2 {q) is given by (15.511 . 

A numerical example with the obtained bounds is given in Figure 15.11 
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Figure 5.1: The lower and upper bounds on the leading eigenvalue w in the case of the quaternion 
landscape lExamole 14.8p . (a) N = 8, (b) N = 50. 


5.2 Admissible sequences of orbits 


To make a progress in analyzing the limit behavior of our system when N ^ oo we introduce in this 
subsection two dehnitions in terms of which this behavior will be described. 

From the previous subsection, we see that the curve w = wiow{q) has a corner point on [0.5,1], 
which we denote g*: 


{N) 

q* = q* - 


N ws\A\2 ^ ff u\A\2 ^ 


w + s 


u + 1 


IwjO.b) 

w{l) 


(5.9) 


The function wiow{q) is constant for 0.5 < q < q* and increases for g* < g < 1 (see Figure ISTl) . It was 
shown in m that for the single peak landscapes (j^j = 1) the lower bound wiow{q) provides a close 
approximation for w{q) for sufficiently large N. Our goal is to generalize these results on the case of 
the two-valued fitness landscapes. 

From this point on we shall use n as the index, which tends to infinity. In most cases it actually 
coincides with the sequence length N, albeit not always, hence the choice of notation. 

One of the main underlying questions concerning the quasispecies model and especially its infinite 
sequence limit, is how actually the fitness landscape is scaled when N ^ oo. In most works in literature 
a continuous limit is used, which basically narrows the pull of the allowed fitness landscapes to the 
ones which have, given this continuous limit, a limit fitness function, which must be also continuous 
(e.g., mm)- Here we take a different approach by specifying sequences of orbits on which 

the fitness landscape is dehned. The sequences that are of interest to us will be called admissible. 

Suppose that for any n > uq a sequence of G,i-orbits A^ G A„ is given, where ^ Iso(A„). 
When n —)• oo the group Iso {Xn) will be always viewed as a subgroup of Iso (Xn+i). More precisely, let 
g € Iso (Xn) be a fixed isometry and let a G Xn+i be represented as a = a„ -|-a„2"' where an, an G Xn- 
Then g, viewed as an element of Iso(A„_|_i), maps a G Xn+i to g{a) := g{an) + g{an)2^. In other 
words. Iso (Xn) as a subgroup of Iso (Xn+i) is acting on the “upper” hyperface x {1} of the cube 
Vn+i = {0, 1}”+^ = 14 X {0, 1} in the same way as it acts on the “lower” hyperface 14 x {0} = I 4 . 
Thus, we have the ascending chain 


Iso {Xno) < . . . < Iso {Xn) < IsO (A„+l) < . . . 


16 













and the corresponding ascending chain of symmetric subgroups 

'S'no < Sn ^ < . . . . 


For a fixed w > 0 consider a sequence of landscapes {'w^'^^)n>no such that = w + s if k ^ An 
and = w otherwise. The sequence {An)^=no parameters w, s, and u = w/s define the 

corresponding family of leading eigenvalues (q) , which are solutions of (I2.2p , and the family 

such that /s. 

In [19] it was proved that for any n > uq the function v!^'^\q) has the following properties: 

1. The function u^^\q) increases on the segment [0.5,1] and is convex (downward) there. 

2. uW(0.5) = u + ^, n(")(l)=u + l. 

2n 

Definition 5.3. A sequence {An)’^=no of Gn-orbits is called admissible if the corresponding sequence 
of values of polynomials FA„{2q — 1) in (13.81) is non-increasing for each q G [0.5,1]; 

n n+l 

FAj2g -l) = Y, /T <1 - «)■'«""' S E fF" (1 - = Fx„„{2, - 1), n > n„. (5.10) 

d=0 d=0 

Definition 5.4. A sequence {An)'i^^no of Gn-orbits is called a sequence of the moderate growth if 

lim J—^ = 0 , or = o( 2 ”), re —>■ oo . (5T1) 

n—¥oo 2 ^ 

To show that our definitions make sense we state 


Proposition 5.5. In all the examples of Section^the corresponding sequences of orbits are admissible 
and of the moderate growth. 

Proof. See Appendix O ■ 


Consider a sequence of G,i-orbits. Our next aim is to investigate what happens with the 

corresponding family as re —>■ oo. 

Proposition 5.6. If {An)^^no admissible sequence of Gn-orbits then for each fixed q G [0.5,1] 

the sequence {u^'^\q))'i^=no ® non-increasing sequence as n ^ oo. If, additionally, is a 

sequence of the moderate growth then lim re(”)(0.5) = u and lim «("■)(!) = re + 1. 

n—>-oo n—>-cxD 

Proof. The second assertion follows directly from Property 2 of {q) above. Let us proof the first 
one. The equation (15.2p for re 7 ^ 0 can be rewritten in the form 


re = 






ta) = E 


c=0 




c+l 


FAn{i2q-ir+^)=J2 


m=l 


U 


u^'^\q) 


FA^m-ID- (5.12) 


It follows from Definition 15. 3l that at each fixed point q G [0.5,1] the sequence of positive coefficients 
{Fa„ ((2q—l)'^+^))„>„g is non-increasing for any c-|-l G N. But the left-hand side re of p5.12p is constant. 
This implies that {u^'^\q))n>no must be a non-increasing sequence for each q G [0.5,1]. ■ 
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Hence we can conclude that the curve u = {q) always passes under the curve u = (q) in 

the rectangle {0.5 <q'<l, n<u<n + l}if {An)^^no admissible sequence of G,i-orbits, see 

Figure ED 

Proposition 15.61 and Property 1 of (q) yield 

Corollary 5.7. If is an admissible sequence of Gn-orbits of the moderate growth then for 

any fixed e € (0, 1] there exists iVo G N such that for any n> Nq the curve u = vf'^\q) intersects the 
line u = u + e at a unique point q^^^ (e, u) G (0.5,1]. 

Note that by virtue of (|5.2I1 . (15.121) . and (13.lip the value u) from the previous corollary can 

be found from one of the following equations 


OO / \ c 

“ + " = E(^) rA.{(2ci-ir+^) 

or, 

— l)*^ _ ^ 

u-\-e — u{2q — 1)^ 

Another almost immediate result is given in the following 


(5.13) 


(5.14) 


Proposition 5.8. If {An)n>No is an admissible sequence of Gn-orbits of the moderate growth then for 
fixed (e,u) the sequence {q^'^\£,u))n>No is non-decreasing as n ^ oo and the inequality 


q^^He,0) < q^'^\e,u) < 


u + e 

M + 1 


< 1 - 


1 — e 
n{u + 1) 


(5.15) 


holds. 

Proof. The upper bound (see Section ED) u = ~ '“ + -^A„(2g — 1) gives rise to the lower bound 

in ()5.15p since the equation u + e = u + FA„{2q — 1) is equivalent to ()5.13p when u = 0. The lower 
bound (|5.7p u = (u + l)g” provides the upper bound in (j5.15p . 

Since u = {u + l)g” is convex downward if g G [0.5,1] and u = u + 1 — n{u + 1)(1 — g) is the 
equation of the tangent at g = 1 to the curve ¥ = (u + l)g" then we get the last inequality in (I5.15p . 
Note that the curve u = (g) has the same tangent at g = 1 (see, for instance, m)- ■ 

The obtained results are illustrated in Figure EH 


5.3 Threshold-like behavior 

In this subsection we define rigorously what we call the threshold-like behavior and provide sufficient 
conditions for the sequences of admissible orbits to possess this kind of behavior. The main conclusion, 
which can be stated in a form of a conjecture, emphasizes the role of geometry for the threshold¬ 
like behavior to occur. Loosely speaking, if the admissible sequence of orbits “looks like a point” 
asymptotically, i.e., basically indistinguishable from the single peaked landscape in the infinite length 
limit, then the threshold-like behavior is observed. We conjecture, as numerical experiments show, 
that the opposite is true: If asymptotically the admissible sequence of orbits is different from a point, 
then there exits no threshold-like behavior. 
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Figure 5.2: The curves in the coordinates q, u defined by (from top to bottom): u = u+Fa„ (2g—1), u = 
TF{q), u = TF~^^{q), u = {u + The points of intersections of these curves with the dotted line 

u = u + e define the values q^'^\e,0), q^'^\e,u), q^'^~^^\e,u), 1 —respectively, see also (I5.15p . 


Let us introduce the notation 


M ! \ 

q* [£,u) = 


from where 


It follows that for a fixed u > 0 


lim n(l — qi^\e,u)) = log 




£ 4-0 n->-QO 


U + £ 

ri + 1 ’ 

(5.16) 

, u + l 
= log . 

U + £ 

(5.17) 

.X 1 U+l 

')) = log 

(5.18) 


u 


It is known (e.g., m) that for the single peaked landscape the curve u = (q) passes very close 

to the lower bound u = max{rt, {u + !)(?"'} in such a way that 


,(”>t,«)=5i">(E,a)-o(i)= 


— o 


as n —)• oo (from (|5.15p we have the inequality q^^\e, u) < q^f’ie^u)). 

Our next aim is to investigate what happens with the curve u = u^'^\q) as n ^ oo. It is more 
conveniently done in coordinates x, L, defined by 


X , n _ , u + 1 


q = 1 -, 0 <x<—, u= {u + l)e , 0 < L < log 

n 2 u 

We will assume that tt > 0 in (j5.19ll . Hence the curve u = u^'^\q) transforms into the curve 

Ln{x) = log(tt + 1) - logU^”^ . 

Note that Ln(0) = 0 since u^"'^(l) = u + 1 for any n. 


(5.19) 


(5.20) 
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Definition 5.9. W& say that an admissible sequence {An)n>no of the moderate growth, or, equivalently, 
the family {u^'^^)n>no possesses the threshold-like behavior on the segment [0.5,1] if for each fixed x > 0 
and the corresponding functions Ln{x) it is true that 


lim Ln{x) = L{x) 

n—>-oo 


X, 0 < X < log 

log^, x>log^. 


The definition above is illustrated in Fig. 15.31 


(5.21) 



Figure 5.3: The limit function L{x) in Definition 15.91 of the threshold-like behavior 


If the threshold-like behavior is present in the two-valued fitness landscape, then the following 
formula provides an approximation for the threshold mutation rate q*^\u), 1: 


(n), , . 1 , U+l 1 W + S 

ql M n) « 1-log- = 1-log- 

n u n w 


(5.22) 


which, of course, coincides with the classical estimate for the error threshold for the single peaked 

landscape mm- 

If the sequence of continuous functions (ri^"'^)n>no has the threshold-like behavior then it converges 
not uniformly on [0.5,1], as n ^ oo, to the discontinuous function 'ijj{q) such that V’(9) = u if 
0.5 < g < 1 and V'(l) = n -|- 1. 

The following theorems and corollaries provide sufficient conditions under which an admissible 
sequence of orbits of the moderate growth shows the threshold-like behavior. 


Theorem 5.10. In the above notation suppose that for n > uq an admissible sequence of Gn- orbits 
An C Xn {Gn ^ Iso(X„)) of the moderate growth is given and u > 0. Suppose also that for n > uq 
the inequality 

Fa„ {2q - 1) < {2q - l)"/^ + M, , 0.5 < g < 1, (5.23) 

is satisfied for some constants Mn such that lim Mn = 0. Then the sequence {An)n>no shows the 

n—yoo ~ 

threshold-like behavior on the segment [0.5, Ij. 
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Proof. In view of equation (15.21) . in coordinates x, L: 


{u + ^ 

therefore (putting m = c + 1) 

OO 

E 


su.Ti 5 ==)''-((-r': 


U = 


m=l 


M + 1 


^mLr,{x)p 


i-^Y" 

n / 


(5.24) 


The lower bound (15.7p . u>{u + l)q^, implies for x G [O, log , n 1, 


Lnix) = log . ^ -< —n log f 1- 

mW( 1 V n 


Consequently, we have on [O, log 

limsupLn(3;) < — lim nlog (1-) = x. 

n .—V Tl / 


(5.25) 


(5.26) 


On the other hand, the function FA„{2q — 1), as the leading eigenvalue for n = 0 (see ()3. 121) 1. is 
increasing on the segment [0.5,1]. In view of the inequality 1 — t < we have 

Make the substitution 2q — 1 = into (I5.23p . where 0 < a: < +oo. Then the following inequality 




1 - 


2x 


n 


2\ 

j ^-2mx/n^ ^ pn{Lr,(x)-x) ^ pj^pnLn{x) 


holds. Hence, (|5.24p yields 


m=l 




ue 


Ln(x) 


m=l 


u + l 


In view of (j5.25l) both progressions in the right-hand side converge for x G [0,log and n 3> 1. 
The simplification provides the inequality 

. < Mx)-x , e^"(^)(n + l-ne^"(^)-^) 

~ "" (tt + l)(ri + 1 — 

Since —?• 0 and the inequality (j5.25p holds we get finally lim inf gL„(x) a: > ^ 


liminfL„(a:) > x. (5-27) 

n^oo 

It follows from (j5.26p . (j5.27h that lim Lnix) = a: if 0 < x < log^^^^. The increasing functions 

n^oo “ 

Ln{x) cannot exceed the value log Thus, the threshold-like behavior is observed. ■ 
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Corollary 5.11. The sequence of constant single peaked landscapes An = {a}, n > uq possesses the 
threshold-like behavior. 

Proof. Condition (I5.23P of Theorem 15.101 reads as follows: the inequality 

< i2q - 1)^/2 ^ 0.5<q< 1, (5.28) 

holds for some constants Mn such that lim Mn = 0. 

n—>-oo 

Let us show that the constants 

-Y~' < ^ 

nJ ~ e{n — 1) 

ht. Consider the auxiliary function ipniQ) = q'^ — {2q — . Its maximum on the segment [0.5,1] 

is reached at some point < 1. This point is a root of the equation 

- niflq - l)"/^-! = q , or (2^ - \fl^ = {2q - l)q^-^ . 

Hence, by the definition of (fnio), we get 



hn = Tniqn) = Qn - (‘^Qn “ 1)C ^ = Qn “ 9n) • 

The function M[t) = f”'“^(l — t) achieves its maximum on [0,1], which is equal to Mn = ^ (l — 

at tri = 1 ~ Hence, pin < Mn and the conditions of Theorem 15.101 hold. ■ 

Theorem 15.101 together with Corollary 15.111 are the key results as the following theorem shows. We 
are convinced that the reason for the error threshold effect is geometric. More precisely, in view of 
p.Sp the polynomial FA„{2q — 1) can be always represented in the form 

n 

F^„(2« - 1) = ,” + /!”> (1 - (5.29) 

k=l 


where fj^^ = if{b G An \ Hab = k}. Thus, this polynomial can be viewed as a kind of the spherical 
growth function of the orbit An with respect to an arbitrary fixed point a G An. 

Theorem 5.12. In the above notation assume that for any n > uq an admissible sequence of Gn-orbits 
An C Xn {Gn ^ Iso (Xn)), u > Uq of the moderate growth is given and u > 0. If either 


[n/2\ 

lim max fif^^ (1 

n^ooqe[0.5,l] ^ 

k=l 




or 


lim 

n—>-oo 


Ln/2J 

E/- 


(n) 

k 


k=l 


n 



then the sequence {An)n>no possesses the threshold-like behavior. 


(5.30) 


(5.31) 
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Proof. The polynomial Pl^\q) = (1 — q)^q'^ ^ decreases on the segment [0.5,1] \i n < 2k < 2n 
and achieves its maximal value at g = 0.5. If 1 < A: < [n,/2j then the maximal value of 
PjTHq) = (1 “ q)^q'^~^ on [0.5,1] is achieved at the point = 1 — ^ and is equal to (l — ^. 

,[n/ 2 j f{n) Corollarv l5.1H (I5.28P and (15.291) together yield 


Denote K = inax 
qe[0.5,l] 


FAA‘^q -l) = q^ + Y,fk\'^- 

k=l 


<q^ + Fn+ Y. 


An) 

Jk 


< {2q - l)’"/2 ^ 


1 


n 


1 - 


1 


n 


n—l 


F Fn + 


fc=[n/2j+l 

^ = (2,-1)”/=+ 0(1) 


(5.32) 


n 


oo. 


On the other hand, if the equality (I5.3ip holds we can substitute (l — ^ for 

Fn since fj^'^ (n)^ “ n)^ ^ ■ Hence, Theorem 15. 101 implies that the sequence 

shows the threshold-like behavior. ■ 

Corollary 5.13. The following sequences of orbits possess the threshold-like behavior: 

(i) All the constant sequences An = A {see, for instance, Example \4.6\ of the quaternion landscape); 

(ii) All the antipodal sequences An = {a, a*} C Xn {see Example 

{Hi) All the permutation invariant sequences Ap^n where Ap^n = {a G X | Ha = p} , p = 0,1,... ,n, 
and p does not depend on n> p {see Examples \4.1\ and E3- 

Proof, {i) Let A C X„p. Since the orbit is fixed then for n > no all the coefficients 
fj^^ = 0 when k > no. It follows that the assumption (I5.31|) of Theorem 15.121 that 


Lno/2j 


lim Y fi 


(no) 

k 


k=l 


1-^ 

n 


n—k 


= 0 


holds since k < [no/2j and, consequently, (^)*'^0asn—s-oo provided (l — ^<1- 

{a) In this case FA„{q) = q'^ + {1 — (?)"■. Then = 0, k = 1,..., [n/2j. Then both assumptions 
(lom . (IQB hold. 

{Hi) We may suppose that n > 2p = no. In view of (|4.2h 




k=0 


n — p 
k 


(1 - q)^'^q 


2k ^n—2k 


Hence, since p is fixed, (|) < 2^, 1 — ^ < 1: 

P /._\ /._ ._\ / 7 _ \ 2/c / o 7 „\ n—2k 


E 


p\ n — p 


k 


i-“) 

n J 


< 




n 


2k 


k=l / \ / \ / k=l 

as n ^ OO. Consequently, the condition (|5.3in is satisfied. 


n — p 
k 


< 


(2pA’'f2 


n 


k=l 


k\ n^^ 


= 0 ( 1 ) 
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Figure 5.4: The leading eigenvalue w{q) versus the mutation rate q in the case of the fitness landscape 
in Example 14.61 The sequence length 2n = 50 in the left panel and 2n = 100 in the right panel. Note 
the absence of the threshold-like behavior 


Remark 5.14. In contrast, if = An^ 2 n is the sequence of the fitness landscapes in Example 
14.61 then the numerical calculations (Eig. 15.41) show that this sequence does not demonstrate the 
threshold-like behavior. The approximate formula (|4.19|) provides a lower bound 


n / \ 2 

max yfn {1-qf’^q^^-'^^ >0.183 
<?e[0.5,i] ^ V V 


— - r =--W ( - 

Sr-y/^ ’ 4 \ 3v^/ 


1.7423, 


where W{z) is (a branch of) the Lambert W function {W = z). Hence, the sufficient conditions 
for the threshold-like behavior are not satisfied in this case. 


A natural question to ask is whether the given sufficient conditions are also necessary for the 
threshold-like behavior. While at this point we do not have a full answer for this question, we can 
present a sufficient condition for the absence of the threshold like behavior of the sequence (?I^”'^)n>no 
as n —>■ oo. This sufficient condition shows in a way that the condition (I5.30p is “almost” necessary 
for the error threshold. 

Proposition 5.15. Suppose that there exist constants e > 0, x > 0 such that for all n >> uq the 
inequality 

FAS‘^qn-l)>{u + l){ql + 2e), = 1 - ® , (5.33) 

n 

holds for sufficiently small u > 0. Then the sequence {An)n>no possesses no threshold-like behavior. 

Proof. We can assume that x < log for sufficiently small u > 0 and > 0.5 for sufficiently large 
n. In view of (15.2p 

OO / 4- 1 \ 

n^^\qn) = FaMQu - 1) + E ( j - 1)^+1) > FaMQu -1)>{u + 1)(C + 2^). 

Hence, 

Ln{x) = log(u-b 1) - log {qn) < “ log(g" -b 2s) = - log (^(l “ 0 + 2e) < - log(e“^ -b e). 
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for n S> no- Consequently, limsupLji(x) < — log(e ^ + e) < x. ■ 

n^oo 

Note that in Remark 15.141 we can take x = r ^ 1.7423, u < 0.1, e = 0.01, n > 4. 

6 General construction for the Eigen evolutionary problem 

The classical Eigen quasispecies model uses as the underlying geometry the V-dimensional hypercube. 
The distances between the vertices of this hypercube are measured by the number of edges connecting 
them. While this geometry has a transparent biological interpretation in terms of sequences composed 
of zeroes and ones, which can be identifies with, e.g., purine and pyrimidine, we feel that it is a natural 
generalization to consider an arbitrary isometry group acting on an abstract metric space to move to 
a next level of abstraction of the quasispecies model (a somewhat relevant discussion of the original 
Eigen model can be found in [9l|T6]). This section provides a concise description of such generalization. 
While we concentrate here on the mathematical development of the model, we would like to note that 
an abstract construction of a simplicial fitness landscape can be used to model real biological systems, 
in particular the switching of the antigenic variants of some bacteria [T]. 

6.1 Groups of isometries and a generalized algebraic Eigen quasispecies problem 

The previous results, when we encode individuals of a population by the vertices of the binary cube 
X = {0,1}^ equipped with the Hamming distance, can be generalized as follows. Let {X, d) be a 
finite metric space. We will assume that the metric d: X x X —>• Nq is an mfe^er-valued function. 

Consider a group T ^ Iso(X) of isometries of X and suppose that T acts transitively on X, that is, 
X is a single T-orbit (we use the notation for the left action). Since T acts transitively on X we can 
fix an arbitrary point xq G X and consider the function dxQ ■ X —> Nq such that dx^ix) = d{x,xo). 
By definition, 

diam(X) := max{da,g(x) | x G X} 

is called the diameter of X. The number N = diam(X) does not depend on the choice of xq. 

Let us point out a few of important general geometric examples. 

Example 6.1 (Weyl chamber systems). Let T = W be the Weyl group of the root system A of a 
simple finite-dimensional Lie algebra g over C acting on the Weyl chamber system X (see [3], chapter 
VI). Eor instance, if A is of type An then W = Sn- The distance between two chambers x, y is the 
minimal number of chamber walls we need to pass from x to y. It is known (e.g., see [3j, chapters IV, 
V) that d{x^ y) is just the length of the unique element w gW such that y = wx when W is viewed 
as a reflection group generated by a set S of reflections which correspond to fundamental roots (see 
more general Example 6.3 below.) The number N = diam(V) is known as the Coxeter number of W 
and |V| = |IV|. 

On the other hand, the Weyl group W = Sn of type An acts on the V-dimensional regular 
simplex, the Weyl group W of type Bn (or Cn) acts on the A-dimensional cube since the root lattice 
is cubic in the latter case. Thus, we come to the next class of geometric examples. 

Example 6.2 (Regular polytopes). Let X = be the the set of vertices of an n-dimensional regular 
polytope P (see, e.g., [7] and Fig. EU), all edges of which have an integer length e, say, of a regular 
m-gon (m > 2) on the plane, of a tetrahedron, octahedron or icosahedron in the 3-dimensional space 
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Figure 6.1: Examples of regular polytopes in dimension 3: tetrahedron (regular simplex), cube, octa¬ 
hedron 

(see Fig. 16.11 for some examples) and so on, equipped with the “edge” metric: the distance between 
X and y is the minimal number of edges of P connecting x and y multiplied by e. For n-dimensional 
unit cube the edge metric is the same as the Hamming metric. 

The group of all isometries F = Iso(P) acts on P and, consequently, on X. For instance, let P be 
an icosahedron or dodecahedron. Then F = ^5 where < S 5 is the alternating group of order 60. 

Example 6.3 (Groups as metric spaces). Let G be a finite group generated by a set 5 = 5“^. The 
word metric d = ds on G is defined as follows (see [ 8 ], chapter IV for more details and examples): 
d{g,h) = l{g~^h) where l{g~^h) = I is the minimal number of generators s £ S needed to represent 
g~^h as a product si... s;. The word metric is invariant with respect to the action of G on itself by 
left shifts h —>• gh. Hence, we have the metric space X = G and the transitive action of F = G on X 
by isometries. 

More generally, for any subgroup H ^ G we can define the metric space Xh = {gH \ g G G} of 
the left cosets of G by H. The group G acts on Xh by left shifts and 

d{gH, aH) = min{d(x, y)\x £ gH^ y £ aH} . 

If G acts transitively by isometries on a metric space X then as a G-set X is isomorphic to the set 
of left cosets G/Str(a:o), xq £ X. 

Example 6.4 (p-adic metric spaces). Let p be any fixed prime, Zp be the commutative ring of p-adic 
integers equipped with the standard p-adic metrics dp{x,y) = ||x — p||p. Consider the quotient rings 
Xn,p = Zp/p”Zp, n G N, with the scaled metric dp{x,y) = P^~^\\x — y\\p {x denotes the coset x+p'^Zp) 
on which the additive group F = F^^p = V„,p acts 1-transitively by isometries L- : x —)• 7 -|- x. Here 

P^n,p — diam(V^ p) — P , ln,p — |^n,p| — P • 

For p = 2, n = 3 we have 2-adic “cube” ^ 3^2 which is different from the binary cube with the 
Hamming metric. 

Now consider a quadruple {X,d,T,w) where {X,d) is a finite metric space with integer distances 
between points of diameter N and cardinality I = |V|, a group F ^ Iso(V) is a fixed group and a 
fitness landscape w = {wx)~^ is a vector-column of non-negative real numbers called fitnesses indexed 
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by X G X. The quadruple {X,d,T,w) will be called homogeneous T-landscape. In other words, the 
sequences of the population are encoded hy x £ X. 

Consider also the diagonal matrix W = diag(u;a;) of order I called the fitness matrix, the symmetric 
distance matrix D = with integer entries of the same order and the symmetric matrix 

Q = ((1 — for q G [0,1]. Finally, we introduce the distance polynomial 

Px{q) = X] (1 - , xo G X (6.1) 

xex 

Since F acts transitively on X this polynomial is independent on the choice of xq G X and is the 
sum of entries in each row (column) of Q. 

The following definition generalizes the classical Eigen quasispecies problem we dealt with in the 
previous sections. 

Definition 6.5. The problem to find the leading eigenvalue w = w{q) of the matrix -p^j^QW and 
the eigenvector p = p{q) satisfying 

QWp = Px{q)wp, Px=Px{q)>0, '^px{q) = l (6.2) 

xex 

will be called the generalized algebraic Eigen quasispecies problem. 

Note that in (j6.2p 

WxPx ■ (6.3) 

x&X 

Due to the Perron-Frobenius theorem a solution of this problem always exists. Also note that the 
uniform distribution vector 

p=^{i,....ig = (6.4) 

provides a solution to (16.2p in the case of the constant fitnesses Wx ^ w > 0. 

The problem (|6.2I) turns into the classical Eigen evolutionary problem for the X-dimensional binary 
cube X = {0,1}'^ with the Hamming metric and F = Iso(A) which was named in 1930 by A. Young 
a hyperoctahedral group. F is isomorphic as an abstract group to the Weyl group of the root system 
of type Bn or Cn and is acting on the cube. In the classical case Px{q) = 1- 

Consider also the following serial examples. If X is the set of vertices of an n-dimensional regular 
simplex with all edges of unit length then F = Iso(Y) = 5n,+i, N = diam(A) = 1 and I = \X\ = n + 1. 
The distance polynomial is 

Px{q) = q + n{l - q). (6.5) 

If X is the set of vertices of an n-dimensional hyperoctahedron with all edges of unit length then 
F = Iso(X) is again a hyperoctahedral group (the hyperoctahedron is the dual polytope to the cube), 
N = diam(X) = 2 and I = |Y| = 2n. The distance polynomial is 

Px{q) = q'^ + (2n - 2)(1 - q)q + (1 - qf . (6.6) 
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6.2 Properties of the distance polynomial 

In the notation of Section [Q consider the polynomial Px{q) = Px,d{Q)- The polynomial Px{q) is 
strictly positive on [0,1] (if N is strictly equal to diam(X). If > diam(X) then Px{0) = 0, such 
cases sometimes we will need to consider) and possesses the following properties: 

1 . 

Px{l) = 1, (^) = ^ = ^ • (6-7) 

2 . 

N 

Px{q) ^ ’ (6.8) 

k=0 

where the non-negative integers fk = fkiX) := #{x G X \ d{x,xo) = k} are the cardinalities of 
d-spheres in X with the center at the fixed point xq and of radius k. 

Remark 6.6. The polynomial Sx{t) = often called the spherical growth function of 

{X,d). See, for instance, [ 8 ], chapter VI for details and examples. 

Suppose that we scaled the metric d by a positive integer factor e. Let Px,e-d{q) denote the new 
distance polynomial. Then 

N 

PxMq) = E (1 - =Y^f„{l- qy^qe(.N-k) ^ ^ ^ . (6.9) 

x&X k=0 

Since q G [0,1] we may assert that the sequence {Px,e-d{Q) | e G N} is non-increasing at each fixed 
point g G [0,1]. 

6.3 Regular simplicial fitness landscapes 

To give a specific example of the analysis of the generalized algebraic Eigen quasispecies problem 
we shall briefly consider two-valued fitness landscapes related to the set of vertices of the regular 
n-dimensional simplex X with Iso(V) = Here we follow the main lines of Section |5J 

Biologically, the simplicial fitness landscape means that we deal with a population of individuals 
such that any individual can mutate to any other individual with the same probability equal to 1 — q. 
Even such oversimplified construction can model a non-trivial biological system. Here, for example, if 
we consider “mutation” as a sudden discrete genetic (heritable) change then the simplicial geometry 
can describe, at a first approximation, the switching of the antigenic variants for some bacteria. These 
variants turns one into another with almost equal probabilities, whereas the corresponding htnesses 
of different variants are defined by interactions with the host immune system (e.g., m)- 


6.3.1 General scheme 

Let X = {0,1,... , n} and d{i,j) = 1 if i / j, d{i, i) = 0. Hence, V is a metric space with the trivial 
metric, N = diam(X) = 1 and I = jVj = n + 1. The distance polynomial is defined by ()6.5p . 

Let H C {0,1,... , n}. Consider the landscape 

f rc -|- s, k G A , 

= [ w, k^A. 
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The matrix W of fitnesses can be represented as follows 


W = wl + sEa = wl + S Ea, 

aeA 


( 6 . 10 ) 


/ being the identity matrix and Ea being the elementary matrix with the only one nontrivial entry 
Saa = 1 on the diagonal. 

We want to solve the problem (|6.2p . The matrix Q = {Q^a) = (2? — 1).^^ + (1 “ q)E where all the 
entries of E are ones, that is 

l-q, bj^a, 
q , b = a . 


Qba ~ 

It can be directly checked that 

D{n, q) := T~^QT = diag(g + n(l - q),2q - I,... ,2q - 1), 
where for the symmetric transition matrix T of order n + 1 we have 


T = 


1 

-1 

0 .. 

0 

^-1 1 

1 

— n 

1 .. 

1 

1 

0 

-1 .. 

0 

1 

1 

—n .. 

1 





’ n + 1 








0 





1 

V 1 

0 

0 .. 

■ -1 / 


V 1 

1 

1 .. 

. -n / 


1 \ 


The transformation of (16.2p yields 

T-^QTT-^WTT-^p = Pxiq)wT-^p = (q + n(l - g))uJT“V 
or, in view of (j6.1ip . 


( 6 . 11 ) 


( 6 . 12 ) 


D{n, g) Ul + s T-^EaT T'^p = Px{q)wT-^p, 
\ a&A J 

whence 

{Px {q)wl - wD{n, q))T~^p =^D{n, q)T~^EaP. 

a^A 

Let X = p = Tx. Then ()6.14p implies 

X = s'^{Px{q)w I - D{n,q)))~^D{n,q)T~^EaP, 

aeA 


or, in coordinates, 

Din,q)kA~a^^Pa 
^ ~^Px{q)w-wD{n,q)k 

Since p = Tx, then we get from (|6.15p 


k = 0 ,... ,n. 


p = s ^ T{Px{q)wI - D{n, q))-^D{n, q)T-^ EaP. 
aeA 


(6.13) 

(6.14) 

(6.15) 

(6.16) 

(6.17) 
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Only the components pa, a G A, are involved in the right-hand side of ()6.17p . By definition, 
Ea = 'Yla^A^o- ^ projection matrix. Hence, we can multiply both sides of (|6.17l) by Ea- 

EaP = sEAT{Px{q)wI - D{n, q)))~^D{n, q)T~^ EaP- (6.18) 

We can rewrite (|6.18p as 

Pa = Mpa, Pa = EaP , (6.19) 

where 


M = sEa T{Px{q)wI - D{n, q)))-^D{n, q)T 


-1 


E 


wPx{q) \ wPx{q) 


c=0 

oo 


w 


Ea TD{n,qy+^T-^ 


( 6 . 20 ) 


E i? 


wPx{q) ^ \ wPx{q) 


w 


Ea Q 


c+l 


It follows that vector Pa is an eigenvector of M corresponding to the eigenvalue A = 1. 

Consider w in (j6.19p . (16.201) as a parameter. It follows from (j6.3n that W depends only on o € ^, 
that is, on the “reduced” vector Pa = EaP- The original eigenvector p can be reconstructed from 
Pa with the help of (j6.17p . Thus, instead of the original problem we arrive to the reduced problem 
to find the eigenvector Pa satisfying (16.181) and corresponding to the eigenvalue A = 1 of the matrix 
M = (mba) defined in (I6.20|) b The parameter w = w{q) satisfies the formula 


w = w 


Ps'^Pa- 


a&A 


( 6 . 21 ) 


6.3.2 Equation for the eigenvalue w 
Lemma 6.7. In (j6.20p we have 


C+l 


= (2q-iy+^I+Y, 


m=l 


c + l 


m 


(n + lT-\2q - l)"+i-”*(l - q^E . 


( 6 . 22 ) 


Proof. We apply directly the binomial expansion for the matrix = {{2q — 1)1 -|- (1 — q)E)^~^^. 

Since all the entries of E are ones then = (n -I- 1)E and, consequently, E™ = {n + l)'^-^E. m 


The equality (16.191) implies that 


m-baPa = '^Pa'^ ^ba- (6.23) 

b&A b£A aeA aeA b£A 


Suppose that the sum ^ mba does not depend on a G A. Then it follows from (16.231) that ^ mba = 1 

b£A _ b£A 

for each a & A. In view of ()6.20p 


wPxjq) 

s 



w 

wPx{q) 


Y,{EAQ^^\a, 

b£A 


if the inner sum does not depend on a £ A. 


(6.24) 
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Theorem 6 . 8 . In the previous notation let A be a subset of a simplicial metric space X. Then the 
equality 


Lq - ^ V 


c + 1 


m 


{n + ir{2q-lY+^-^{l-qy 


(6.25) 


holds. 


Proof. Since Iso(X) = Sn+i acts (n + 1)-transitively on X 
{ 0 , 1 ,... , |A| — 1 } on which the cyclic subgroup C|^| = (( 0 , 1 , 
IKm to (IblWll . 


we may assume that A is the subset 
... , |A| — 1)) is acting. Then we apply 


The formula (16.25^ can be simplified as follows. Recall (see (I6.5j) l that Px{<l) = <1 + n{l — q). The 
binomial expansion yields 


c+l 

E 

m=l 


C+l 


m 


(n + l)”^(2g-l)^+^-™(l-g)™ 


= {{2q - 1 ) + (n + 1)(1 - q)r+^ - {2q - = Pxiqf^^ - {2q - 1 )^+^ . 

Hence, ()6.25l) reads 


wPx{q)_^f w X((, 1^c+l , l-4| „ 


^E 


■s + \ wPx{q) 


1 - 


(2q-lf+^ + ^Px{qr 

n+1J n+1 


(6.26) 


Summing the geometric progressions we finally get 


1^1 


(n -|- 1 )(m — u) 


+ 1 - 


1^1 


2 q - 1 


w w , , 

I / / ^ = 1) u = ~) u = — . (6.27) 

n-|-1 y {q + n{l — q))u — {2q — l)u s s 


Remark 6.9. Note that the equation (I6.27|) depends only on |H| and the dimension n. It follows that 
(j6.27l) provides the eigenvalue of the two-valued fitness problem ()6.2I) for any subset A C X. Note also 
that the equation (I6.27j) turns into the equation of degree 2 = iV -|- 1 where = 1 is the diameter of 
the simplex (compare with Corollary 13.3p . We expect that for the hyperoctahedral landscapes we will 
get cubic equations since N = 2 for a hyperoctahedron (with unit edges) in any dimension. 

The solution to (I6.27P is given by the following formula^ 


u = 


){q) + \/v‘^{q) - A{u u^){2q - l)(g' n(l - q)) 
2 [q + n{l-q)) 


v{q) = {q + n{l - q)) [u+ ^ + {2 q-l) (u + 1- 


n -|- 1 


n -|- 1 


(6.28) 


^This is a correct formula, unfortunately, in the published version the factor (q + n(l — q)) is missing 
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6.3.3 Simplicial error threshold 


In this subsection some results of Section [S] are appropriated for the case of the simplicial landscapes. 

Let Xn be the set of vertices of an n-dimensional regular simplex with edges of unit length and let 
(^n)^no’ ^ be a Sequence of subsets. Let u = u^'^\q) be the sequence of the corresponding 

eigenvalues (see (16.281) 1. It can be checked that each function is increasing on the segment [0.5,1] 
and convex downward. 

A sequence {An)n>no is called a sequence of the moderate growth if 


lim 


IA 


= 0 . 


n—>oo n + 1 

Let us denote In view of ()6.28p ui”i(0.5) = u + Or, 


(6.29) 


u as n —)• oo if the sequence 
(^n) n>no is of the moderate growth. On the other hand, = n + 1. 

Consider new coordinates x, L such that 

u + 1 


X n 

9 = 1-, 0<x<-, 

n 2 


L = 


u 


- 1, 0 < L < 1/u. 


We assume that tt > 0 in (|6.30p . The curve u = transforms into the curve 

u + 1 


L = Ln{x) = — 


(n) (1 _ 


- 1 . 


(6.30) 


(6.31) 


Definition 6.10. We say that a sequence (A„)))T„^ of the moderate growth, or, equivalently, the 
family (ui”i)))T„^ possesses the threshold-like behavior on the segment [0.5,1] if for each fixed x > 0 
and the corresponding functions Ln{x) it it true that 


lim Ln{x) = L{x) = < 


1 

X, 0 < X < — , 
u 

1 1 

—, X > — . 

u u 


(6.32) 


The following formula provides an approximation for the error threshold value (if exists) q^\u), 
n > 1: 


qi"'^ (u)«l-^ = 1- 

nu nw 

Theorem 6.11. In the above notation suppose that a sequence of subsets An C Xn of the moderate 
growth is given and n > 0. Then the sequence {An)’^=nQ shows the threshold-like behavior on the 
segment [0.5,1]. 


1 


Sketch of a proof. In view of the equation (|6.27l) . in coordinates x, L: 


®n(l T I'n{s^)') 


+ 


(1 - an){l - 2x/n) 


1 -uLnix) (1 + X - x/n) - (1 - 2 x/n) u 


= 1 


(6.33) 


The existence of lim Ln{x) for a hxed x, 0 < x < 1/u, can be proved with the help of lower and 

n^oo _ 

upper estimates. If n —>• oo in (|6.33p we get (since = \ An\/{n + 1) —)■ 0) 

1 


(1 + x) 


l+u 

1+L(x) 


— U 


= 1 , 
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Figure 6.2: Illustration to Definition 16.101 


or L{x) = X on [0, l/u). Since Ln{x) increases with respect to n and cannot exceed the value 1/u we 
obtain the desired result. ■ 

In a similar fashion other geometric examples can be analyzed. 


7 Concluding remarks 

There are two main points to emphasize in order to conclude the presentation. First, in this text we put 
forward general, rigorous, and quite elementary methods to analyze two-valued fitness landscapes in 
the classical quasispecies model. While a great deal of analysis of this problem in the existing literature 
was inspired by the analogies with the famous Ising model of statistical physics, we show that direct 
methods of linear algebra allow gaining full understanding of the properties of the selection-mutation 
equilibrium in this model at least in some special cases. 

Second, the language of the group theory gives us an opportunity to look at the phenomena 
associated with the quasispecies model from a more general and abstract point of view. In particular, 
the infamous error threshold can be looked at from the position of the external and internal metric 
properties of orbits. If the set of population sequences is enumerated by points of a finite metric space 
X with integer-valued metric d on which a group F acts transitively by isometries then we can involve 
group theoretical and algebraic tools in order to obtain not very complicated solutions for the leading 
eigenvalue problem, at least in the special case of the two-valued fitness landscapes. Such a classical 
approach is in accordance with the well known F. Klein’s Erlangen program. We are convinced that 
this connection between mathematical biology, finite geometries, combinatorics and algebra confirms 
the importance of Eigen’s model from various viewpoints. 

To reiterate, in the general case we consider a quadruple {X, d, F, w) — homogeneous F-landscape 
— with the fitness function w: X —>■ R>o. The information of the geometric properties of the 
underlying metric space {X,d) is contained in the symmetric matrix Q = ((1 — ^ 

q € [0,1], where N = diam(X) is the diameter of X. The diameter N as well as the cardinality 
I = jXj are the two main numerical characteristics of the model {X, d, F, w). The distance polynomial 
Px{q), which is the leading eigenvalue of the matrix Q, plays the key role in the analysis. For the 
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classical Eigen’s quasispecies model X = {0,1}^ is the binary cube with the Hamming distance, 
diam(X) =N,l = \X\= 2^ and Px{q) = 1- 

Suppose that we have a subgroup G ^ E which also acts on X and the fitness function w is 
constant on the orbits of G-action. We saw in Sections El El that for the two-valued fitness functions 
w{A) = w+s, w{X\A) = w, A being any G-orbit, the degree of the equation on the leading eigenvalue 
can be reduced from I to N + 1. Although the solution of the leading eigenvalue problem appears in 
an implicit form we are able to obtain lower and upper bounds for it. 

We can also consider sequences of metric spaces Xn and orbits An as n —>■ oo. Usually we have a 
chain 

XnoC-- - CXnC Xn+1 C ■ ■ ■ C [J Xn = X^o ■ 

n 

The analysis presented in the main text allows to conjecture that the error threshold, i.e., non-analytical 
behavior of the leading eigenvalue w in the infinite sequence limit, occurs when the cardinalities \An\ 
grow not rapidly enough comparing with the growth of \Xn\- For instance, when An = A, where 
A contains a single point (the single peaked landscape), or is a fixed constant set (orbit) then the 
threshold-like behavior is observed. This topic is to be investigated in the general situation. 

At the beginning of Section [6] we pointed out the most interesting geometric examples of groups 
and metric spaces for which the generalized Eigen’s algebraic problem could be solved. Among them 
are the Weyl groups acting on the chamber systems (the reflection groups should be added) and 
groups of symmetry of regular polytopes. Example 16.31 deals with all finite groups in general. It 
is very possible, and genuinely intriguing, that some infinite finitely generated groups (free groups, 
non-Euclidean crystallographic groups and others) can be included in the list of groups for the future 
research (see, for instance, mm)- 

A Proof of Proposition 15.5 

The following three lemmas and corollary provide the full proof that all the examples in Section E] deal 
with admissible sequences of orbits of the moderate growth (Proposition 15.5ll . 

Lemma A.l. Let A C Xn^ be a fixed G-orbit. Consider the constant sequences An = A and Gn = G, 
n > no- Then the sequence [An)^=no admissible. 

Proof. Since the orbit is not changing as n —)• oo then it follows from (j3.8p that 

FAfl 2 q - 1) = q^-^°FAn, (2? - 1), qG [0,1]. 

The polynomial PA^i^q — 1) > 0 and > q^-^-i-no Hence, (|5.10l) holds. ■ 

Lemma A.2. Let a„ E Xn, a* = 2” — 1 — an, and An = {a„,a*}. Let Gn = G = {l,g} be the group 
of order 2 such that g{a) = a* for any a E A„. Then the sequence (A„)^„jj is admissible. 

Proof. In view of (|3.12l) and (I4.20p 

FAni2q -l)=q^ + il- q)^ > q^+^ + (I - g)"+' = Ea„+i( 2(? - 1), g E [0,1]. 
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Lemma A. 3. Let p he a fixed number, n > uq = 2p. Let = An^p = {a G \ Ha = p}- Then 
{An)'^=no admissible sequence. 

Proof. It follows from (|4.18p that 


fc=o 


F ^„(25 - 1) = (n ^ ) (1 - =: f„,p(5) . 0 < p < «. 


(A.1) 


At the same time consider the polynomials 

p 


Gn,pisi) 


k=l 


p\ n — p 


k \k — \ 


(1 - 


2k ^n-\-l—2k 


(A.2) 


By definition, Lo,o(^) = 1) Gnfi{q) = 0. Applying the binomial formulas i/) + (fc_i) to 

(lA.ll) and (^) = + (^ij) to ()A.2p we get the following recursive relations: 

-^n+l,p(i7) — QTn,p{q) Gn^p(^q') , Gn+l,p{q) — (1 q) Tn,p—l(,Q') “b QGn,p—liQ') ■ 

When we substitute the left-hand-side of the second formula (IA.3D into the first one (with the 
change n ^ n — 1 ) and then iterate such substitutions we get 


p 

^n+i,p{Q) — (lTn^p{q) -|- (1 — g) ^ ^ q^ Fn—j,p—j{q) ■ 

i=i 

In the same way the equality 

p 

Gn+i,piq) = (1 - 

i=i 


(A.4) 


(A.5) 


can be obtained. 

Formulas (IA.3P imply also that 


Tn,p{si) ^n+l,pidl) — (1 Q)^n,p{si) Gn,p{q) — (1 Q)Q.Tn—l,pidl) “b Q)Gn—l,pidl) Gn,p{q) , 


or 


K,p(q) - Fn+i,p(q) = q((l- q)Fn-i,p(q) - Gn,p(q)) + (1 - q) (Gn-i,p(q) - Gn,p(q)) • (A. 6 ) 

Our objective is to prove that 

T2p-f-k+l,p (q) — T2p-f-k,p(q) ) k t) , O' G [0,1] . 

We will proceed by induction on p and, for a fixed p, by induction on k. 

First of all, the case p = 0 is trivial since Fnfl{q) = q'^. 

Let p > 1 be fixed and let k = 0. Substituting n = 2p into pA. 6 p we get 

T' 2 p,p{q) ~ F 2 p.\-i^p{q) = q ((1 — q)F 2 p—i^p{q) — G 2 p,p{q)) -b (1 ~ O') {G 2 p—i,p{q) — G 2 p,p{q)) ■ 


35 














Let us show that both summands in the right-hand side are nonnegative on [0,1]. On the one hand, 
by definition we have Fn,p{q) = Fn^n-p{q)- Then in view of (IA.3P 


(1 q)F 2 p—i^p{q) G 2 p,p{q) — (1 q)^ 2 p—i,p—i{q) G 2 p^p{q) 

(f q)^ 2 p—i,p—i{q) ((1 q) ^ 2 p—i,p—i{q) t qG 2 p—i,p—i{q)) 

= g(l - q)F2p-i,p-i{q) - qG2p-i,p-i{q) 

= 9(1 - q)F2p-i,p-i{q) - q{F2p^p-i - qF2p-i^p-i{q)) 

= q{F2p-i,p-i{q) - F2p,p-i{q)) > 0 , g € [0,1], 

by the inductive hypothesis. 

On the other hand, it follows from (IA.5I) that 

p 

G2p-i,p{q) — G2p,p{q) = {1 — q) q^ {F2p-j,p-j{q) — F2p+i-j,p-j{q)) > 0 

i=i 

on [0,1] by the same reasons. This finishes the proof for the case k = 0. 

Let A: > 1. Then by virtue of (IA.4D we can assert that 

F 2 p+k,p{q) ~ F 2 pj^k+l,p{q) — 

p 

= q[F 2 p-\-k—i,p{q) ~ F 2 p-\-k,p{q)) + (1 q) ^ ^ q^ {F 2 p+k—i—j,p—j(si) ~ F 2 p-i^k—j,p—j{q)) ^ 0 

i=i 

on [0,1] by the inductive hypothesis. The lemma is proved. ■ 

Corollary A.4. Let A 2 n = A 2 n,n = {a G X 2 n \ Ha = n}. Then (A2n)^no admissible sequence. 

Proof. In the notation of Lemma lA.3l let us prove that F 2 n,n{q) < F 2 n- 2 ,n-i{q) on [0,1]. From 
the first formula (IA.3P we can find the expressions Gn+i,p{q) = Fn+ 2 ,p{q) — qFn+i,p{q)-, Gn,p-i{q) = 
Fn+i^p-i{q) — qFn^p-i{q) and substitute them into the second one. The simplihcation yields 

Fn+2,p{q) — (1 ^q')Fn^p—l(^q') T qFn+l,pi(l) T qFn+i,p—ii.q^ ■ 

Consequently, choosing appropriate values for n, p in this formula, we get 

F2n—2,n—liq) F2n,n{,q) — q{,F2n—2,n—l{,q) F2n—l,n{.qf) T ^(^ 272 —2,n—1 ( 9 ) F2n—l,n—l{,q)^ • 

But in view of ()A.1|) Fn,p{q) = Fn,n-p{q) for all n and p, 0 < p < n. Hence F 2 n-i,niq) = F 2 n-i,n-iiq) 
and it follows from Lemma lA.31 that 

^2)1— 2 , 71 — 1 ( 9 ) F2n,n{q) ‘^q{H2n—2,n—l{q} H2n—l,n—li.qy) — 0 
on the segment [0,1]. ■ 
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